Microprocessor architecture having extendible logic

ABSTRACT

A microprocessor architecture having extendible logic. One or more customized applications are available to the instruction pipeline. The customizable applications may include software, extension logic instruction or register, dynamically configurable hardware logic, or combinations of these. In order to enable the operating system to interface with the customized extension applications, at least one software extension is provided to the operating system. When a specific extension is requested a software exception is generated by the OS. In response to the exception, the least one software extension is called to handle context switch and dynamic configuration of the extendible logic of the microprocessor.

CROSS REFERENCE TO RELATED APPLICATIONS

This application claims priority to provisional application No. 60/572,238 filed May 19, 2004, entitled “Microprocessor Architecture” hereby incorporated by reference in its entirety.

FIELD OF THE INVENTION

This invention relates generally to microprocessor architecture and more specifically to an improved architecture and mode of operation of a microprocessor having extendible logic.

BACKGROUND OF THE INVENTION

Extendible microprocessors have been developed which allow the user to add application-specific logic extensions to a base microprocessor design. This technique seeks to share the benefits of a standard microprocessor design and application-specific logic. By utilizing a standard microprocessor, a designer would seek to perform the majority of required functions by executing program code. The designer will benefit from the flexibility of this approach—the function of the design can be changed after hardware is produced by changing the program code. The designer will also benefit by having access to existing software infrastructure such as compilers, debuggers, operating systems and application code. By utilizing an application-specific logic design, a designer could gain the maximum performance for the application with the minimum logic area—however the function of the design will be fixed in hardware and no changes can be made once the hardware is produced.

The extendible microprocessor shares the benefits of these two approaches—special application-specific extension instructions, registers and interfaces are added to the design. As in the standard microprocessor case, the function of the design is still controlled by a software program, hence the function of the design can still be changed after hardware is produced; and software infrastructure designed for the base microprocessor can be used with extended variants of the processor. High performance can be obtained as key parts of the program are accelerated by adding the application-specific extension functions.

Some disadvantages of the previously described approaches still exist—in some cases software designed for the base microprocessor must be modified for use with an extended variant, this applies especially to operating systems (OS); once hardware is produced the function of the application-specific extensions cannot be changed, meaning that any changes to the function of the design must be achieved by changes to the program that sequences the base and extension operations.

The description herein of various advantages and disadvantages associated with known apparatus, methods, and materials is not intended to limit the scope of the invention to their exclusion. Indeed, various embodiments of the invention may include one or more of the known apparatus, methods, and materials without suffering from their disadvantages.

As background to the techniques discussed herein, the following references are incorporated herein by reference: U.S. Pat. No. 6,862,563 issued Mar. 1, 2005 entitled “Method And Apparatus For Managing The Configuration And Functionality Of A Semiconductor Design” (Hakewill et al.); U.S. Ser. No. 10/423,745 filed Apr. 25, 2003, entitled “Apparatus and Method for Managing Integrated Circuit Designs”; and U.S. Ser. No. 10/651,560 filed Aug. 29, 2003, entitled “Improved Computerized Extension Apparatus and Methods”, all assigned to the assignee of the present invention.

SUMMARY OF THE INVENTION

Thus, there exists a need for a microprocessor architecture which ameliorates and/or eliminates the above noted problems. In particular, there exists a need for a microprocessor architecture with reduced power consumption, improved performance and reduction of silicon footprint as compared with state of the art microprocessors. In addition, there exists a need to allow an operating system to be used with application-specific extensions added to an extendible processor. Furthermore, there exists a need to allow different sets of application-specific extensions to be used with a common operating system design, without requiring the user to make changes to that operating system.

In various embodiments, this is accomplished with a microprocessor architecture in which extendible logic may be added to the microprocessor through fixed-function hardware or programmable logic extensions. An extendible microprocessor may be configured to provide customizable operation specific logic. These operation specific logic blocks, known as extensions, may be implemented as fixed-function logic. These extensions may also be implemented through the addition of a block of dynamically configurable logic connected to the extendible processor. However, such logic may not and should not be available to all other processes. Thus, an extension register disable/enable function will be available to the OS to selectively disable or enable the access to extension registers and instructions. In various exemplary embodiments, when switching between processes the state of the current extendible logic must be stored for future calls to the same process. The operating system, because it was written prior to the extendible logic, will likely not understand how to handle requests for particular extendible operations or switches between operations. In various embodiments, a series of software extensions, analogous to device drivers, are installed to the OS which define properties such as the instruction and register locations associated with each extension and how that extension may be enabled/disabled. Also, recording all of the state information of a particular extendible logic configuration is very costly in terms of time. Therefore, in various embodiments, a technique known as lazy context switching is employed to perform context switches of extension state only when strictly necessary. When a task is switched out, access to all extension state is disabled but the contents of that extension state are not saved. After switching in a new task, when access to a specific extension is requested an exception is generated as access to that extension is disabled. In various exemplary embodiments, the OS calls the software extensions (drivers) to determine whether the exception is a genuine error, or a valid request to extension logic. When a task has made a valid request to use a disabled extension, the OS determines which task last used the disabled extension. If the last use was by the requesting task, no context switch is necessary. If the extension was last used by a different task, a context switch is performed. After this, the extension is enabled until the next task switch—when it is once more disabled.

At least one embodiment provides a method of selectively providing dynamically configurable extendible microprocessor logic supported by an operating system executing on the microprocessor. The method of selectively providing dynamically configurable extendible microprocessor logic supported by an operating system executing on the microprocessor according to this embodiment may comprise providing at least one block of dynamically configurable extension logic and at least one software extension to the operating system. On a software exception generated when a specific extension logic is requested, the operating system calls the least one software extension to handle dynamic configuration of the extendible logic of the microprocessor.

In this embodiment, all extensions are disabled on a task switch. When access to a specific extension is requested, an exception is generated as access to that extension is disabled. In various exemplary embodiments, the OS will call the software extensions (drivers) in order to determine whether the exception is a genuine error, or a valid request to extension logic. In addition to providing for lazy context switch, the OS calls the software extensions (drivers) to determine whether the extension logic expected by the task is loaded into the dynamically configurable extension logic block. If the required extension is loaded, no further action is necessary. If the extension is not loaded, the software extension controls loading of the required extension into the dynamically configurable extension logic block. This method is known as a virtual hardware extensions scheme due to its ability to dynamically configure (swap in) different extension logic hardware as required by any particular task.

At least one additional embodiment provides a method of selectively providing context switch support for extendible microprocessor logic to an operating system executing on the microprocessor. The method of selectively providing context switch support for extendible microprocessor logic to an operating system executing on the microprocessor according to this embodiment may comprise providing at least one software extension to the operating system, the software extension defining properties including instruction and register locations of the extendible logic and how that extendible logic may be enabled/disabled, generating a software exception when a specific extendible logic is requested, and in response to the exception, calling with the operating system the least one software extension to perform context switching of data contained within the specific extendible logic.

Another embodiment of the invention provides a method of selectively providing dynamically configurable extendible microprocessor logic to an operating system executing on the microprocessor. The method of selectively providing dynamically configurable extendible microprocessor logic to an operating system executing on the microprocessor according to the embodiment may comprise providing at least one software extension to the operating system, generating a software exception when a specific extension logic is requested, and in response to the exception, calling with the operating system the least one software extension to handle dynamic configuration of the extendible logic of the microprocessor.

A further embodiment of the invention provides an extendible microprocessor. The extendible microprocessor according to this embodiment may comprise a microprocessor having a multi-stage instruction pipeline, and an extension interface to the instruction pipeline adapted to complement a standard microprocessor instruction set with customized processor instructions and registers.

Still another embodiment of the invention provides a microprocessor architecture. The microprocessor architecture according to this embodiment may comprise a multi-stage instruction pipeline, an instruction extension interface for interfacing with at least one stage of the instruction pipeline, a plurality of extension applications available to an operating system through the instruction pipeline, a required extension determining circuit, routine or application for identifying instructions in the pipeline that require one of the plurality of extension applications, and an extension register disable/enable circuit, routine or application available to the operating system for selectively enabling and disabling extension applications.

Other aspects and advantages of the invention will become apparent from the following detailed description, taken in conjunction with the accompanying drawings, illustrating by way of example the principles of the invention.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a block diagram illustrating a processor core in accordance with at least one exemplary embodiment of this invention; and

FIGS. 2 and 3 are block diagrams illustrating a microprocessor architecture utilizing operating system supported virtual hardware extensions in accordance with at least one exemplary embodiment of this invention.

DETAILED DESCRIPTION OF THE DISCLOSURE

The following description is intended to convey a thorough understanding of the invention by providing specific embodiments and details involving various aspects of a new and useful microprocessor architecture. It is understood, however, that the invention is not limited to these specific embodiments and details, which are exemplary only. It further is understood that one possessing ordinary skill in the art, in light of known systems and methods, would appreciate the use of the invention for its intended purposes and benefits in any number of alternative embodiments, depending upon specific design and other needs.

Discussion of the invention will now made by way of example in reference to the various drawing figures. FIG. 1 illustrates in block diagram form, an architecture for a microprocessor core 100 and peripheral hardware structure in accordance with at least one exemplary embodiment of this invention. Several novel features will be apparent from FIG. 1 which distinguish the illustrated microprocessor architecture from that of the conventional microprocessor architecture. Firstly, the microprocessor architecture of FIG. 1 features a processor core 100 having a seven stage instruction pipeline. An align (ALN) stage 120 is shown in FIG. 1 following the fetch (FET) stage 110. Because the microprocessor core 100 shown in FIG. 1 is operable to work with a variable bit-length instruction set, namely, 16-bits, 32-bits, 48-bits or 64-bits, the align stage 120 formats the words coming from the fetch stage 110 into the appropriate instructions.

In various exemplary embodiments, instructions are fetched from memory in 32-bit words. Thus, when the fetch stage 110 fetches a 32-bit word at a specified fetch address, the entry at that fetch address may contain an aligned 16-bit or 32-bit instruction, an unaligned 16 bit instruction preceded by a portion of a previous instruction, or an unaligned portion of a larger instruction preceded by a portion of a previous instruction based on the actual instruction address. For example, a fetched word may have an instruction fetch address of Ox4, but an actual instruction address of Ox6. In various exemplary embodiments, the 32-bit word fetched from memory is passed to the align stage 120 where it is aligned into an complete instruction. In various exemplary embodiments, this alignment may include discarding superfluous 16-bit instructions or assembling unaligned 32-bit or larger instructions into a single instructions. After completely assembling the instruction, the N-bit instruction is forwarded to the decoder 130.

Still referring to FIG. 1, an instruction extension interface 180 is also shown which permits interface of customized processor instructions that are used to complement the standard instruction set architecture of the microprocessor. Interfacing of these customized instructions occurs through a timing registered interface to the various stages of the microprocessor pipeline 100 in order to minimize the effect of critical path loading when attaching customized logic to a pre-existing processor pipeline. Specifically, a custom opcode slot is defined in the extensions instruction interface for the specific custom instruction in order for the microprocessor to correctly acknowledge the presence of a custom instruction 182 as well as the extraction of the source operand addresses that are used to index the register file 142. The custom instruction flag interface 184 is used to allow the addition of custom instruction flags that are used by the microprocessor for conditional evaluation using either the standard condition code evaluators or custom extension condition code evaluators in order to determine whether the instruction is executed or not based upon the condition evaluation result of the execute stage (EXEC) 150. A custom ALU interface 186 permits user defined arithmetic and logical extension instructions the result of which are selected in the result select stage (SEL) 160.

FIGS. 2 and 3 are block diagrams illustrating a microprocessor architecture utilizing operating system supported virtual hardware extensions in accordance with at least one exemplary embodiment of this invention. A microprocessor in accordance with at least one embodiment of this invention is capable of logic extension by a user by way of an interface by which extension instructions and registers may be added. Certain slots in the processor's instruction set and register map are reserved to functions reached through this interface. By this interface, special instructions including digital signal processor (DSP) instructions, state machine and related machine state may be extended to the processor in order to make available and/or accelerate applications critical to implementers (customers) of the microprocessor.

In various exemplary embodiments, this extension interface may be used to add fixed function logic, as in an application specific integrated circuit (ASIC), or may be connected to an array of programmable logic (FPGA or PLD block). When connected to an array of programmable logic, it will be possible to dynamically change the behavior of the extension functions based on the current needs of the system. This embodiment of the invention provides a system in which an array of programmable logic is connected to the extension interface of a microprocessor, and the microprocessor has a mechanism by which it can reconfigure the programmable logic array based on context. As result, the CPU can dynamically reconfigure the extension instructions/registers available to programs running on the processor.

A problem with implementing an interface between an extendible processor and an operating system (OS) is that the OS is usually codified before the processor and any extension software interface has been created. As a result, the OS is incapable of dealing with these extensions without the assistance of drivers or other software routines. FIGS. 2 and 3 illustrate the basic components of a system for in accordance with this embodiment of the invention. FIG. 2 illustrates a microprocessor core 200 that includes a dynamically reconfigurable extension instruction/register logic accessed through an extension interface 210. The extension interface 210 communicates with a software interface 220 of the operating system in response to demands of the operating system. FIG. 3 illustrates a system 300, which can be dynamically reconfigured under the control of the microprocessor. The system 300 includes an extendible processor 310, a required extension determining routine or application 320, an extension register disable/enable routine or application 330, an available extension routine or application 340 and one or more extension applications 350, all interconnected by software interface 360.

In various exemplary embodiments, the required extension determining routine or application 320 will inform the OS which of the possible extension functions are required by a particular program or user application. In various exemplary embodiments, this could take the form of an extension to the ELF executable format. The extension register disable/enable routine or application 330 permits the OS to enable or disable access to extension instructions or registers, either individually or as a group. If an access is made or attempted while the OS has disabled the extension function, an exception is taken. In various exemplary embodiments, the available extension routine or application 340 informs the OS about an available extension function or group of functions. In various exemplary embodiments, this will include configuration information about the extension, including at least which extension instruction slots are used, which extension register slots are used and names of instructions and register used for disassembly and debug. In various exemplary embodiments, this will also include a software function by which the OS can determine if the required extension function is already built into the microprocessor, a software model of the extension to allow software emulation if the extension is not available in hardware, and optionally, a configuration file containing the required extension for reconfigurable hardware connected to the extension interface.

In various exemplary embodiments, before running an application, the user or administrator would register (install) available extension functions with the OS, analogous to a virtual device under Microsoft WINDOWS™. Also, because context switching between extensions requires storing state information of the extendible hardware, a technique known as lazy context switching is used to manage context switching of both machine state stored in both fixed function extensions and dynamically reconfigurable extensions

In various exemplary embodiments, when a user application is executed, no extension functions are loaded or enabled. The OS will check to ensure that an extension function specified by the program has been registered. Any access to an extension function (e.g., register read or write, extension instruction execution) by the program will cause an exception—either an illegal instruction exception or an extension disabled extension. In various exemplary embodiments, this will cause the OS to check to see if the instruction exception is caused by an access to an extension function that is required by the program (as specified in the ELF file). Otherwise, the error is a true illegal instruction error and the OS takes its normal error handling action.

In various exemplary embodiments, if the extension to which access is being attempted is specified as being required by the program as determined by the required extension determining routine or application 320, the OS may take one of the following courses of action. If the extension is already built into the microprocessor, the extension (or extension group containing the function) can be enabled. Before the program is allowed to continue, it may be necessary to context switch any state stored in the extension (lazy context switch) if the extension has been in use by some other task. This technique is often used by an OS to minimize the context switch time. Alternatively, if the extension required is available as a dynamically reconfigurable block, the OS must determine whether the extension is already loaded using the available extension routine or application 340. If so, the OS can then check to see if any state in the extension needs to be context switched. If the extension is not loaded, the OS must save the machine state from an extension configuration that is loaded prior to loading the required extension configuration into the dynamically reconfigurable block. The extension (or extension group containing the function) can then be enabled and the program allowed to continue. Finally, if the extension required is not available as a built-in function, or as a dynamically reconfigurable block, the appropriate emulation code (as registered with the OS) can be called to perform the required function in the absence of an existing hardware extension.

The above described embodiment, by making extensions to the OS error handling code and to the executable file format, provides unexpected benefits including, that a common OS can be used on many different configured microprocessors, different executable programs complied for a common Instruction Set Architecture (ISA), but using different sets of extension functions can be run under a multi-tasking OS on any microprocessor using the using the common ISA using software emulation of extension functions, or built-in extension functions themselves. Also, different executable programs compiled for a common ISA, but using different sets of extension functions can be run at speed under a multi-tasking OS on any microprocessor using the common ISA, when that microprocessor features dynamically reconfigurable extension logic. As a result, this could provide a powerful development platform for extension functions by which extension functions can be prototyped in software or programmable logic while also running in a real OS-based environment. Also, this could be used to deliver a field-upgradeable system whereby a new piece of software would be accompanied by hardware acceleration for key functions of that software, making use of dynamically reconfigurable hardware. Existing functions using other extension functions would still be able to operate as the OS can seamiessly manage which extension functions must be preset n it he dynamically reconfigurable hardware for each task.

While the foregoing description includes many details and specificities, it is to be understood that these have been included for purposes of explanation only, and are not to be interpreted as limitations of the present invention. Many modifications to the embodiments described above can be made without departing from the spirit and scope of the invention. 

1. In a microprocessor, a method of selectively providing context switch support for extendible microprocessor logic to an operating system executing on the microprocessor, the method comprising: providing at least one software extension to the operating system, the software extension defining properties including instruction and register locations of the extendible logic and how that extendible logic may be enabled/disabled; generating a software exception when a specific extendible logic is requested; and in response to the exception, calling with the operating system the at least one software extension to perform context switching of data contained within the specific extendible logic.
 2. The method according to claim 1, further comprising: providing means to the operating system to selectively disable or enable access to extendible logic.
 3. The method according to claim 2, wherein providing means to the operating system to selectively disable or enable access to extendible logic comprises providing an extension register disable/enable application to the operating system.
 4. The method according to claim 1, further comprising providing means for switching between software processes using the extendible logic functions.
 5. The method according to claim 4, wherein providing means for switching between software processes using the extendible logic functions comprises providing means for performing lazy context switching.
 6. The method according to claim 5, wherein providing means for performing lazy context switching comprises providing means for storing a state of the data contained within the current extendible logic comprises performing lazy context switching.
 7. In a microprocessor, a method of selectively providing dynamically configurable extendible microprocessor logic to an operating system executing on the microprocessor, the method comprising: providing at least one software extension to the operating system; generating a software exception when a specific extension logic is requested; and in response to the exception, calling with the operating system the least one software extension to handle dynamic configuration of the extendible logic of the microprocessor.
 8. The method according to claim 7, further comprising: providing means to the operating system to selectively disable or enable access to the extendible logic including extension registers and instructions
 9. The method according to claim 8, wherein providing means to the operating system to selectively disable or enable access to extension registers and instructions comprises providing an extension register disable/enable application to the operating system to selectively disable or enable access to extendible logic including extension registers and instructions.
 10. The method according to claim 7, further comprising providing means for switching between software processes using the extendible logic functions.
 11. The method according to claim 10, wherein providing means for switching between software processes using extendible logic functions comprises storing a state of the data contained within current extendible logic for future calls to the same process.
 12. The method according to claim 11, wherein storing a state of the data contained within the current extendible logic comprises performing lazy context switching.
 13. The method according to claim 10, wherein providing means for switching between software processes using extension logic functions comprises storing a state of the contents of a reconfigurable logic block containing extendible logic for future calls to the same process.
 14. The method according to claim 10, wherein storing a state of the contents of a reconfigurable logic block containing extendible logic comprises performing lazy context switching.
 15. An extendible microprocessor comprising: a microprocessor having a multistage instruction pipeline; and an extension interface to the instruction pipeline adapted to complement a standard microprocessor instruction set with customized processor instructions and registers.
 16. The extendible microprocessor according to claim 15, further comprising means for selectively disabling/enabling the customized instructions and registers.
 17. The extendible microprocessor according to claim 16, wherein the means for selectively disabling/enabling instructions and registers comprises an extension register disable/enable application provided to an operating system executing on the microprocessor.
 18. The extendible microprocessor according to claim 15, further comprising means for switching between customized instructions and registers.
 19. The method according to claim 18, wherein means for switching comprises means for storing a state of the current instructions and registers for future calls to the same process.
 20. The method according to claim 19, wherein means for storing comprises means for performing lazy context switching.
 21. The extendible microprocessor according to claim 18, wherein means for switching between customized processor instructions comprises means for reconfiguring a block reconfigurable logic connected to the extension interface.
 22. The method according to claim 21, wherein means for switching comprises means for storing a state of the reconfigurable logic area for future calls to the same process.
 23. The method according to claim 22, wherein means for storing comprises means for performing lazy context switching.
 24. A microprocessor comprising: a multistage instruction pipeline; an instruction extension interface for interfacing with at least one stage of the instruction pipeline; a plurality of extension applications available to an operating system through the instruction pipeline; a required extension determining circuit, routine or application for identifying instructions in the pipeline that require one of the plurality of extension applications; and an extension register disable/enable circuit, routine or application available to the operating system for selectively enabling and disabling extension applications.
 25. The microprocessor according to claim 24, further comprising means for storing the state of the current extendible logic when switching between extension applications for future calls to the same extension application.
 26. The microprocessor according to claim 24, wherein the plurality of extension applications comprises a plurality of operation specific logic blocks.
 27. The microprocessor according to claim 24, wherein the plurality of extension applications comprise a block of dynamically configurable logic operable to be configured to perform a plurality of different functions.
 28. The microprocessor according to claim 24, further comprising logic for determining whether an extension application requested by a current task is loaded into the dynamically configurable extension logic block.
 29. The microprocessor according to claim 28, further comprising logic for loading of the required extension into the dynamically configurable extension logic block.
 30. The microprocessor according to claim 24, wherein the plurality of extension applications comprises instruction and register locations associated with each extension, and how each extension may be enabled/disabled.
 31. The microprocessor according to claim 24, further comprising logic for determining, upon receiving a request from a task to use an extension application from a task, if the last use of the application was by the requesting task.
 32. The microprocessor according to claim 31, further comprising logic for performing a context switch if the extension was last used by a different task than the requesting task. 